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Abstract 

We provide an analytic, microscopic analysis of extreme events in 
an adaptive population comprising competing agents (e.g. species, cells, 
traders, data-packets). Such large changes tend to dictate the long-term 
dynamical behaviour of many real-world systems in both the natural and 
social sciences. Our results reveal a taxonomy of extreme events, and pro- 
vide a microscopic understanding as to their build-up and likely duration. 
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Large unexpected changes or 'extreme events' (e.g. crashes in financial mar- 
kets, or punctuated equilibria in evolution) happen infrequently, yet tend to dic- 
tate the long-term dynamical behaviour of real-world systems in disciplines as 
diverse as biology and economics, through to ecology and evolution. The ability 
to generate large internal, so-called endogenous changes is a defining character- 
istic of complex systems, and arguably of Nature and Life itself ^, ^. Such 
changes are manifestations of subtle, short-term temporal correlations resulting 
from internal collective behaviour. They seem to appear out of nowhere and 
have long-lasting consequences. To what extent can they ever be 'understood'? 
Followers of the self-organized criticality view would claim this question is 
naive because of an inherent self-similarity in Nature: any large changes are 
simply magnified versions of smaller changes, which are in turn magnified ver- 
sions of even smaller changes, and so on. Such self-similarity is presumed to 
underlie the power-law scaling observed in natural, social and economic phe- 
nomena 1^. However, there are reasons for believing that the largest changes 
may be 'special' in a microscopic sense Power-law scaling is only approx- 
imately true, and does not apply over an infinite range of scales. Apart from 
being atomistic at the smallest scale, a population of competing agents can- 
not cause any effect larger than the population size itself: in short, the largest 
changes will tend to 'scrape the barrel' in some way. Reference |^ quotes Bacon 
from Novum Organum: "Whoever knows the ways of Nature will more easily 
notice her deviations; and, on the other hand, whoever knows her deviations 
will more accurately describe her ways" . 

This paper addresses the task of understanding, and eventually controlling, 
the large endogenous changes arising in a complex adaptive system comprising 
competing agents (e.g. species, cells, traders, data-packets). Our work reveals a 
taxonomy of large changes, and provides a quantitative microscopic description 
of their build-up and duration. Our results also provide insight into how a 
'complex systems manager' might contain or control such extreme events. 

We consider a generic complex system in which a population of Nfot hetero- 
geneous agents with limited capabilities and information, repeatedly compete 
for a limited global resource. Our model was introduced in Ref. Q, and is a 
generalization of the El Farol bar problem and the Minority game, concerning 
a population of people deciding whether to attend a popular bar with limited 
seating ||]. At timestep t, each agent (e.g. a bar customer, or a market trader) 
decides whether to enter a game where the choices are option 1 (e.g. attend the 
bar, or buy) and option (e.g. go home, or sell): Nq agents choose while A^i 
choose 1. The 'excess demand' D[t\ = iVi — A'o (which mimics price-change in a 
market) and number V[t\ — Ni + Nq of active agents (which mimics volume of 
market orders) represent output variables. These two quantities fluctuate with 
time, and can be combined to construct other global quantities of interest for the 
complex system studied (e.g. summing the price-changes gives the current price). 
This model can reproduce statistical and dynamical features similar to those in a 
real- world complex adaptive system, namely a financial market Q , and exhibits 
the crucial feature of seemingly spontaneous large changes of variable dura- 
tion P, pj. The resulting time-series appears 'random' yet is non-Markovian, 
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with subtle temporal correlations which put it beyond any random-walk based 
description. The temporal correlations of price-changes and volume, and their 
cross-correlation, are of intense interest in financial markets where so-called 
chartists offer a wide range of rules-of-thumb Q] such as 'volume goes with price 
trend'. Although such rules are unreliable, the intriguing question remains as 
to whether there could in principle be a 'science of charting'. 

A subset V[t] < Ntot of the population, who are sufficiently confident of 
winning, are active at each timestep. For A^i < Nq the winning decision is 1 
and vice-versa, i.e. the winning decision is given by H\—D[t^ where H[x\ is the 
Heaviside function. The global resource level is so limited, or equivalently the 
game is so competitive, that at least half the active population lose at each 
timestep The only global information available to the agents is a common 
bit-string 'memory' of the m most recent outcomes. Consider m — 2; the 
P = 2™ — A possible history bit-strings are 00, 01, 10 and 11, which can also 
be represented in decimal form: ^ e {0, 1, . . . , P — 1}. A strategy consists 
of a response, a'^ e {—1,1} to each possible bit-string /i, = 1 option 
1, and = —1 ^ option 0. Hence there are 2^ = 16 possible strategies. 
The heterogeneous agents randomly pick s strategies each at the outset, and 
update the scores of their strategies after each timestep with the reward function 
x[D] = sgn[— D], i.e. -1-1 for choosing the minority action, —1 for choosing the 
majority action. Agents have a time horizon T over which strategy points are 
collected, and a threshold level r which mimics a 'confidence'. Only strategies 
having > r points are used, with agents playing their highest scoring strategy. 
Agents with no such strategy become temporarily inactive Q . We focus on the 
regime where the number of strategies in play is comparable to the total number 
available, since this yields seemingly random dynamics with occasional large 
movements ^. The coin-tosses used to resolve ties in decisions (i.e. A'o = 
A^i) and active-strategy scores, inject stochasticity into the game's evolution. 
Reference Q showed that a simplified version of this system in the limit r — > — oo 
and T ^ oo, can be usefully described as a stochastically disturbed deterministic 
system. We are interested in the dynamics of large changes, and adopt the 
approach and terminology of Ref. [^. Averaging over our model's stochasticity 
yields a description of the game's deterministic dynamics via mapping equations 
for the strategy score vector S}t\ and global information For s ~ 2 the 

deterministic dynamics are given exactly by the following equations: 

t-i 

^W=i£[0]- «'^''''sgn[i?[z]], (1) 

i=t-T 

^i[t] = 2n[t ~l]-PH [^i[t - 1] - P/2] + H [D[t - 1]] . 
The corresponding demand function is given by 

2P 2P 
R=l R' = l 

where ^ is the symmetrized strategy allocation matrix which constitutes the 
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quenched disorder present during the system's evolution Elements '^b.^r' 
enumerate the number of agents holding both strategy R and R' . The volume 
V[t] is given by the same expression as D[t] replacing a^'*' by unity. 

Large changes such as financial market crashes, seem to exhibit a wide range 
of possible durations and magnitudes making them difficult to capture using 
traditional statistical techniques based on one or two-point probability distri- 
butions A common feature, however, is an obvious trend (i.e. to the eye) in 
one direction over a reasonably short time window: we use this as a working def- 
inition of a large change. In fact, all the large changes discussed here represent 
> 3ct events. In both our model and the real-world system, these large changes 
arise more frequently than would be expected from a random- walk model ||l|, 2|. 
Our model's dynamics can be described by trajectories on a de Bruijn graph |3|: 
see Fig. 1 for m = 3, with a transition incurring an increment to the score vector 
S_. There are P orthogonal increment vectors g/^, one for each node /i. Setting 
the initial scores S_[0] = 0, the strategy score vector in Eq. (|]) can be written 
exactly as: 



p-1 



S[t] = coa" + cia^ + . . . + cp-ia^ ^ = ^ CjoP 



where Cj represents the nodal weights for history node fJ. ~ j. The nodal weights 
enumerate the number of negative return transitions from node fi minus the 
number of positive return transitions, in the time window t — T — > t — 1. 
High absolute nodal weight implies persistence in transitions from that node 
i.e. persistence in D\^. Large changes will occur when connected nodes become 
persistent. The simplest type of large movement exhibiting perfect nodal per- 
sistence would be /i = 0, 0, 0, 0, . . . in which all successive price changes are in 
the same direction. We call this a 'fixed-node crash' (or rally). However, there 
are many other possibilities reflecting the wide range of forms and durations of 
the large change. For example, on the m = 3 de Bruijn graph in Fig. 1 the 
cycle /i = 0, 0, 1, 2, 4, 0, . . . has four out of the five transitions producing price- 
changes of the same sign (it is persistent on nodes 1, 2, 4 and antipersistent 
on node 0). We call this a 'cyclic- node crash' (or rally). Figure 2 illustrates 
a large change which starts as a fixed-node crash then subsequently becomes 
a cyclic-node crash. Cyclic-node crashes can be treated simply as interlocking 
fixed-node crashes, hence for clarity we focus here on a single fixed-node crash 
(or rally). For the parameter ranges of interest, the choice about whether a 
strategy is played by an agent is more determined by whether that strategy's 
score is above the threshold, than whether it is their highest-scoring strategy Q. 
This is because agents are only likely to have at most one strategy whose score 
lies above the threshold for confidence levels r > 0. Making the additional 
numerically-justified approximation of small quenched disorder (i.e. the vari- 
ance of the entries in the strategy allocation matrix is smaller than their 



4 



mean for the parameter range of interest [p|), the demand and volume become: 



2P 
R=l 

? P 

N N 4-^ 



(2) 
(3) 



i?=i 



Suppose persistence on node fj, ~ starts at time io- How long will the 
resulting crash last? To answer this, we decompose Eq. (||) into strategies which 
predict 1 at /Lt = 0, and those that predict 0. We first consider the particular 
case where the node = was not visited during the previous T timesteps, 
hence the loss of score increment from time-step t — T will not affect S\t] on 
average. At any later time to + r during the crash, (i.e. /i = 0) Eqs. (||) and (||) 
are hence given by: 



D[to 



N 



sgn[S'flJio] - r-r] - 




sgn[S';^[to] - r - r] 



sgn[S'flJto] - ■ 



sgn[S'i^[to] - ■ 



(4) 



\D[to + t]\ decreases as the persistence time r increases, and hence the crash 
ends at time to + Tc when the right-hand side of Eq. (^) changes sign. The 
persistence time or 'crash-length' Tc is thus given by the mean of the scores 
of the strategies predicting 0, i.e. Tc ~ S]i^a'^=-i[to] = —co[to]- In the more 
general case where the node /i = was visited during the previous T timesteps, 
Tc is given by the largest r value which satisfies: 



T= \ coN +5]sgn[D[t']; 
ft'} 



where {t'} 3 {^[t'] = ODto-T <t' < to + r- 
near-Normal distribution, i.e. Sj^^^^t^ ^_i[to] 



T) . Assume that the scores have a 
•-^ N[S'_i, cr] as in Fig. 3a. For each 



strategy R there exists an anticorrelated strategy R and hence SR[t] — — 5;^[t] 
for all t. Consequently, prior to a crash, the score distribution tends to split 
into two halves as indicated schematically in Fig. 3a. The expected demand 
(and volume) during the crash are then: 



< D[to -I- r] > oc erf 



Co [^o] + r + T 



< V[t() + r] > oc 2 - erf 



V2a 
Co [to] + r + T 
V2a 



- erf 



-co[<o] + r - T 



— erf 



V2a 

-co[fo] + r - T 
V2a 
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These forms are illustrated in Figure 3b. As the spread in the strategy score 
distribution is increased, the dependence of < D > and < ^ > on the parame- 
ters T and r becomes weaker and the surfaces flatten out leading to a smoother 
drawdown, as opposed to a sudden severe crash. As the parameters 5-1, a, r are 
varied, it can be seen that the behaviour of the demand and volume during the 
crash can exhibit markedly different qualitative forms yielding a taxonomy 
of different species of large change even within the same single-node family. This 
result could explain why financial market chartists' rules-of-thumb [Q, such as 
'volume goes with price trend', are far too simplistic. 

We now turn to the important practical question of whether history will 
repeat itself, i.e. given that a crash has recently happened, is it likely to happen 
again? If so, is it likely to be even bigger? Suppose the system has built up 
a negative nodal weight for ^ — at some point in the game (see Fig. 4a). 
It then hits node /x = at time to producing a crash (Fig. 4b). The nodal 
weight Co is hence restored to zero (Fig. 4c). In this model the previous build- 
up is then forgotten because of the finite T score window, hence cq becomes 
positive (Fig. 4d). The system then corrects this imbalance (Fig. 4e), restoring 
Co to 0. The crash is then forgotten, hence Co becomes negative (Fig. 4f). The 
system should therefore crash again - however, a crash will only re-appear if the 
system's trajectory subsequently returns to node /i = 0. Interestingly, we find 
that the disorder in the initial distribution of strategies among agents (i.e. the 
quenched disorder in ^) can play a deciding role in the issue of crash 'births and 
revivals' since it leads to a slight bias in the outcome, and hence the subsequent 
transition, at each node. When c^[t] — (see Fig. 4c), it follows that sgn[I?[t]] 

is more likely to be equal to sgn [a'^[*l • x] where x = Y^^iv^i —w ^ strategy 
weight vector with xr corresponding to the number of agents who hold strategy 
R The quenched disorder therefore provides a crucial bias for determining 
the future trajectory on the de Bruijn graph when the nodal weight is small, 
and hence can decide whether a given crash recurs or simply disappears. The 
quenched disorder also provides a catalyst for building up a very large crash. 

Our work opens up the study of how a 'complex-systems-manager' might 
use this information to control the long-term evolution of a complex system by 
introducing, or manipulating, such large changes. As an example, we give a 
quick three-step solution to prevent large changes: (1) use the past history of 
outcomes to build up an estimate of the score vector S_[t] and the nodal weights 
{c^[t]} on the various critical nodes, such as ii — in the case of the fixed-node 
crash. (2) Monitor these weights to check for any large build-up. (3) If such 
a build-up occurs, step in to prevent the system hitting that node until the 
weights have decreased. 
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Figure Captions 

Figure 1: Dynamical behaviour of the global information is described by 

transitions on the dc Bniijn graph. Graph for population of m = 3 agents. 
Blue transitions represent positive demand D, red transitions represent negative 
demand. 

Figure 2: Dynamical behaviour of complex system (e.g. price P \t\ in fi- 
nancial market) described by evolution of nodal weights c^. History at each 

timestep indicated by black square. Large change preceded by abnormally high 
nodal weight. Large change incorporates fixed-node and cyclic node crashes 

Figure 3: (a) Schematic representation of strategy score distribution prior 
to crash. Arrows indicate subsequent motion during crash period, (b) Plots 
of expected demand and volume during crash period showing range of different 
possible behaviour as system parameters are varied. 

Figure 4: Representation of how large changes can recur due to finite memory 
of agents. Grey area shows history period outside agents' memory. Example 
shows recurring fixed-node crash at node /U = 0. 
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transition incurs an increment 
to the score vector S: 
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Stable behavior: path with all transitions equally visited 

eg 0^0^1^3^6^5^3^7^7^6^4^1^ 

Crash: path with many negative (positive) return transitions 
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